Online Importance Weight Aware Updates
نویسندگان
چکیده
An importance weight quantifies the relative importance of one example over another, coming up in applications of boosting, asymmetric classification costs, reductions, and active learning. The standard approach for dealing with importance weights in gradient descent is via multiplication of the gradient. We first demonstrate the problems of this approach when importance weights are large, and argue in favor of more sophisticated ways for dealing with them. We then develop an approach which enjoys an invariance property: that updating twice with importance weight h is equivalent to updating once with importance weight 2h. For many important losses this has a closed form update which satisfies standard regret guarantees when all examples have h = 1. We also briefly discuss two other reasonable approaches for handling large importance weights. Empirically, these approaches yield substantially superior prediction with similar computational performance while reducing the sensitivity of the algorithm to the exact setting of the learning rate. We apply these to online active learning yielding an extraordinarily fast active learning algorithm that works even in the presence of adversarial noise.
منابع مشابه
Online Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features
Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...
متن کاملComputing Environment-Aware Agent Behaviours with Logic Program Updates
The ability of reacting to changes in the external environment is of crucial importance within the context of software agents. Such feature must however be suitably reconciled with a more deliberative rational behaviour. In this paper we show how different behaviours of environment-aware agents can be naturally specified and computed in terms of logic program updates. Logic program updates are ...
متن کاملFaceplant: Impression (Mis)management in Facebook Status Updates
While recent research examined the impressions projected by users of Social Network Sites through their relatively static online profiles, the addition of status updates to Facebook offers the opportunity to study a more fluid type of impression management. In this paper, we take a first look at data collected with a custom application designed to capture the impressions both “given” and “given...
متن کاملOnline HVAC-Aware Occupancy Scheduling with Adaptive Temperature Control
Heating, ventilation and air-conditioning (HVAC) is the largest consumer of electricity in commercial buildings. Consumption is impacted by group activities (e.g. meetings, lectures) and can be reduced by scheduling these activities at times and locations that minimize HVAC utilization. However, this needs to preserve occupants’ thermal comfort and be responsive to dynamic information such as n...
متن کاملExploiting Locality of Churn for FIB Aggregation
Snapshots of the Forwarding Information Base (FIB) in Internet routers can be compressed (or aggregated) to at least half of their original size, as shown by previous studies. In practice however, the permanent stream of updates to the FIB due to routing updates complicates FIB aggregation: keeping an optimally aggregated FIB in face of these routing updates is algorithmically challenging. A se...
متن کامل